NLP QUIZ

Moodle Quiz XHTML Export

1

Which of the following techniques can be used for the purpose of keyword normalization, the process of converting a keyword into its base form?

  1. Lemmatization
  2. Levenshtein
  3. Stemming
  4. Soundex


  • 1 and 2

  • 2 and 4

  • 1 and 3

  • 1, 2 and 3

10

Which are python libraries used in NLP?     

  •   pandas

  • spacy

  •  nltk

  •  All of above

11

What is the name for the space inside which a robot unit operates?



  • danger zone

  •      spatial base

  • work envelop

  •    environment

12

What are the possible features of a text corpus Count of word in a document Boolean feature – presence of word in a document. Vector notation of word Part of Speech Tag Basic Dependency Grammar Entire document as a feature


  •  12345

  •   1234

  • 123

  •   12

13

Which step is the process of breaking down documents into smaller units of analysis?

     


  •  Ngrams

  • Tokenization

  •  Stopwords

  • Corpus

14

Which is a model of measuring the incidence of known words?


  •  A low weight in TF-IDF

  • A high weight in TF-IDF

  •   A corpus

  • A bag of words

15

Tick what is true about WordNet from the following sentences.



  • A machine-readable thesaurus, and aspects of a dictionary

  • A hierarchically organized lexical database

  •  Is a lexical database of semantic relations between words

  •  All of above

16

From the sentence “Fintech Online Course”, how many bigrams can be created?


  • 2

  • 3

  • 4

  • 1

17

Which function would you use to retrieve the list of unique words?


  •   fit_tranform()

  •  CountVectorizer()

  • get_feature_names()

  •  download()

18

Morphotacticsis a model of____________

     


  • How and which morphemes can be affixed to a stem

  •   Spelling modifications that may occur during affixation

  • N-grams of affixes and stems

  •  All affixes in the English language

19

Which is a high term frequency and low document frequency?


  • A low weight in TF-IDF

  •    A high weight in TF-IDF

  • A corpus



  • A bag of words

2

N-grams are defined as the combination of N keywords together. How many bi-grams can be generated from a given sentence:

“Analytics Vidhya is a great source to learn data science”




  • 7

  • 8
  • 9
  • 10

20

What were the objectives of NLP day 1?


  •  Implementing ngrams and word clouds

  • Tokenizing texts into sentences and words

  •  Implement lemmatization and stop wording

  • All of above

21

A vader compound score of 1.02 evaluates to__________


  •  positive sentiment

  •  neutral sentiment

  •  All of the above

  •    None of mentioned above

22

Which are common words usually removed in an NLP analysis?


  •  Corpus

  •  Tokenization

  • Stopwords

  • Ngrams

23

Which is the main Python package we use for NLP?


  •  NLTK

  •  NLP-LIB

  •    Scikit-Learn

  • PyNLP

24

Which of the following Affective States does Sentiment Analysis mostly focus on?

    


  • Personality Traits

  • Mood

  •  Emotion

  • Attitudes

25

Difficulties/Challenges in Word Sense Disambiguation (WSD) .Tick which is (FALSE) from the statements given below.


  •  Completely different algorithm might be needed for different applications

  •   to decide the sense of the word because different senses can be very closely related

  • The problem of Inter-judge variance as the WSD systems are generally tested by having their results on a task compared against the task of human beings

  • Words can be easily divided into discrete sub-meanings

26

How do we get from NLP text analysis to stock price correlation?


  • Convert parts of speech to categorical variables

  • Transform some NLP results into features

  •  Recognize some named entities

  •  All of above

27

Which is the process of turning different morphologies (i.e. versions) of a word into its base form?

 


  • Ngrams

  •   Tokenization

  • Lemmatization<br>

  • Stopwords

28

Which are included in named entity recognition?



  • Currency



  • Time and dates

  • Nouns

  •   All of above<br>

29

Which is a collection of documents?


  •  Corpus

  •   Ngrams

  •   Tokenization

  • Stopwords

3

How many trigrams phrases can be generated from the following sentence, after performing following text cleaning steps:

  • Stopword Removal
  • Replacing punctuations by a single space

“#Analytics-vidhya is a great source to learn @data_science.”






  • 3

  • 4
  • 5
  • 6

30

What does spaCy tagging do?



  • Identifies word order relationships

  •  Identifies more frequent words

  •   Identifies importance and relevance

  •  Identifies parts of speech

31

Which function would you use to implement a bag of words by creating a matrix of token counts?

    


  • fit_tranform()

  • CountVectorizer()

  • download()



  • get_feature_names()

32

.  ____________ is a Python library to make programs that work with natural language.

   


  •  Pandas

  •   NLTK

  •  BeautifulSoup

  • Seaborn

33

Tick whichever is an application Named Entity Recognition (NER)



  •   Efficient Search Algorithms

  • Analysis the rude behavior from customer feedback

  • Classifying content for NEWS providers

  •   All of above

34

What does NLP stand for? Write what the acronym means.

    


  • Netware Lite Protocol

  • Natural Language Processing<br>

  •  Nationwide Loan Processing

  • None of these

35

 In a word cloud, what does the size correspond to?


  • Importance



  •  Frequency<br>

  •   Length

  • Relation

36

N-grams are defined as the combination of N keywords together. How many bi-grams can be generated from given sentence:“NPTEL videos are a great source to learn engineering courses”



  • 6

  • 9

  • 7

  • 8

37

What are the names of lecturers in this NLP part?



  •   Sebastian Poliak

  • Stefan Kosztolanyi

  •    All of the above

  • None of above

38

Why we use named entity recognition in NLP?

   


  • Breaking sentences into words

  • Classify entities into predefined labels

  • Creating a set of vocabularies

  • None

39

Which of the following sense for the word “language” is not available in wordnet? i. a systematic means of communicating by the use of sounds or conventional symbol ii. communication by word of mouth iii. the cognitive processes involved in producing and understanding linguistic communication iv. the style of a piece of writing or speech v. the mental faculty or power of vocal communication


  • 4

  • 3

  • 1

  • 2

4

The Bag-of-Words approach_________


  •   keeps word order, keeps word multiplicity

  • keeps word order, disregards word multiplicity

  • disregards word order, disregards word multiplicity

  • disregards word order, keeps word multiplicity

40

Which are multiple word sequences?

  


  • Corpus

  •  Ngrams

  • Stopwords



  • Tokenization

41

What is the field of Natural Language Processing (NLP)?

    


  • Artificial Intelligence

  •    Linguistics

  •  Computer Science

  • All of the mentioned

5

Which company’s tone analyzer service did we discuss?



  •  Google

  • Apple



  •  Amazon

  • IBM

6

Which news sources did we use?


  •  Reuters

  • News API

  • All of above

  •   None of above

7

What were the objectives of NLP day 2?


  • Use NLTK and VADER to classify news as positive, negative, or neutral

  •  Perform data preparation techniques for sentiment analysis

  • Analyze sentiments and tone from news feeds

  • All of above

8

. Machine Translation is that convert_____________

    


  • Any human language to English

  • Human language to machine language

  •  One human language to another

  •  Machine language to human language

9

What kind of charts are used in visualizations of results?


  •  Scatterplot

  •   Barchart

  •   Windowplot

  • Both A & B

Post a Comment

0 Comments